English-Korean Patent Translation System: FromTo-EK/PAT
نویسندگان
چکیده
This paper addresses a method for customizing an English-Korean machine translation system from general domain to patent domain. The customizing method includes the followings: (1) extracting and constructing large bilingual terminology and the patent-specific translation patterns, (2) adapting the probabilities of POS tagger trained from general domain to the patent domain, (3) syntactically analyzing long and complex sentences by recognizing coordinate structures, and (4) selecting a proper target word using patentspecific bilingual dictionary and collocation knowledge extracted from patent corpus. The translation accuracy of the customized English-Korean patent translation system is 82.43% on the average in 5 patent categories (machinery, electronics, chemistry, medicine and computer) according to the evaluation of 7 professional patent translators. A patent MT system for electronics domain was installed and started an on-line MT service in IPAC (International Patent Assistance Center) under MOCIE (Ministry of Commerce, Industry and Energy) in Korea. In 2007, KIPO (Korean Intellectual Property Office) is expected to launch its English-Korean MT service for whole patent domain.
منابع مشابه
Applying Statistical Post-Editing to English-to-Korean Rule-based Machine Translation System
Conventional rule-based machine translation system suffers from its weakness of fluency in the view of target language generation. In particular, when translating English spoken language to Korean, the fluency of translation result is as important as adequacy in the aspect of readability and understanding. This problem is more severe in language pairs such as English-Korean. It’s because Englis...
متن کاملCustomizing an English-Korean Machine Translation System for Patent Translation
This paper addresses a method for customizing an English-to-Korean machine translation system from general domain to patent domain. The customizing method consists of following steps: 1) linguistically studying about characteristics of patent documents, 2) extracting unknown words from large patent documents and constructing large bilingual terminology, 3) extracting and constructing the patent...
متن کاملFromTo/JK : A Japanese-Korean Machine Translation System
FromTo/JK is a Japanese-to-Korean machine translation system based on direct translation approach due to the syntactic similarity between Japanese and Korean. To resolve the target word ambiguity, transfer uses collocation information of nouns and postpositions, and case frame information of verb. It is evaluated with existing MT systems and shows the best performance.
متن کاملCustomizing an English-Korean Machine Translation System for Patent/Technical Documents Translation
This paper addresses a method for customizing an English-Korean machine translation system from general domain to patent or technical document domain. The customizing method includes the followings: (1) adapting the probabilities of POS tagger trained from general domain to the specific domain, (2) syntactically analyzing long and complex sentences by recognizing coordinate structures, and (3) ...
متن کاملHybrid Approaches to Improvement of Translation Quality in Web-based English-Korean Machine Translation
The previous English-Korean MT system that was the transfer-based MT system and applied to only written text enumerated a following brief list of the problems that had not seemed to be easy to solve in the near future : 1) processing of non-continuous idiomatic expressions 2) reduction of too many ambiguities in English syntactic analysis 3) robust processing for failed or illformed sentences 4...
متن کامل